Content-independent duration model on categories of voice and unvoice segments

نویسنده

Oleg P. Skljarov

چکیده

Trying to understand the experimental data on segmentation of a speech signal by a principle "Voice/Unvoice" has led us to the hypothesis about a pair of logistical dependence between durations of these segments. The segmentation was carried out with the help of the computer program working in quasi real time. The hypothesis about logistic recurrent dependence for sequence of segments durations has allowed to make a conclusion about quasi rhythmical organization of this sequence. With the help of offered recurrent dependences it is possible to explain statistical peculiarities of speech behaviour of stutterers in comparison with normal speech behaviour. These logistic dependences were confirmed by direct experimental data. The assumption of origins of specified rhythm is made. These origins are hidden at the level of control of speech production and perception. Is shown, that the chaotic nature of offered dynamics of formation of large-scale temporary structure allows to enter concept of the information into consideration by a natural way.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Business Model of Sports Academies with an Emphasis on Value Proposition and Customer Segments

Background. Nowadays, sport is considered as a good base for marketing and entrepreneurship. Business model design is also increasingly welcomed. But in the field of sports businesses and sports academies in Iran, no research has been conducted, and no specific business model has been introduced. Objectives. The purpose of this study is to identify and prioritize the value proposition componen...

متن کامل

Comparison of Modeling Target in LSTM-RNN Duration Model

Speech duration is an important component in statistical parameter speech synthesis(SPSS). In LSTM-RNN based SPSS system, the speech duration affects the quality of synthesized speech in two aspects, the prosody of speech and the position features in acoustic model. This paper investigated the effects of duration in LSTM-RNN based SPSS system. The performance of the acoustic models with positio...

متن کامل

The Prosody of Discourse Structure and Content in the Production of Persian EFL Learners

The present research addressed the prosodic realization of global and local text structure and content in the spoken discourse data produced by Persian EFL learners. Two newspaper articles were analyzed using Rhetorical Structure Theory. Based on these analyses, the global structure in terms of hierarchical level, the local structure in terms of the relative importance of text segments and the ...

متن کامل

Voice-based Age and Gender Recognition using Training Generative Sparse Model

Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...

متن کامل

Blind Voice Separation Based on Empirical Mode Decomposition and Grey Wolf Optimizer Algorithm

Blind voice separation refers to retrieve a set of independent sources combined by an unknown destructive system. The proposed separation procedure is based on processing of the observed sources without having any information about the combinational model or statistics of the source signals. Also, the number of combined sources is usually predefined and it is difficult to estimate based on the ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1998

Content-independent duration model on categories of voice and unvoice segments

نویسنده

چکیده

منابع مشابه

The Business Model of Sports Academies with an Emphasis on Value Proposition and Customer Segments

Comparison of Modeling Target in LSTM-RNN Duration Model

The Prosody of Discourse Structure and Content in the Production of Persian EFL Learners

Voice-based Age and Gender Recognition using Training Generative Sparse Model

Blind Voice Separation Based on Empirical Mode Decomposition and Grey Wolf Optimizer Algorithm

عنوان ژورنال:

اشتراک گذاری